智能论文笔记

Visual Detection of Personal Protective Equipment and Safety Gear on Industry Workers

Jonathan Karlsson , Fredrik Strand , Josef Bigun , Fernando Alonso-Fernandez , Kevin Hernandez-Diaz , Felix Nilsson

分类：计算机视觉

2022-12-09

Workplace injuries are common in today's society due to a lack of adequately worn safety equipment. A system that only admits appropriately equipped personnel can be created to improve working conditions. The goal is thus to develop a system that will improve workers' safety using a camera that will detect the usage of Personal Protective Equipment (PPE). To this end, we collected and labeled appropriate data from several public sources, which have been used to train and evaluate several models based on the popular YOLOv4 object detector. Our focus, driven by a collaborating industrial partner, is to implement our system into an entry control point where workers must present themselves to obtain access to a restricted area. Combined with facial identity recognition, the system would ensure that only authorized people wearing appropriate equipment are granted access. A novelty of this work is that we increase the number of classes to five objects (hardhat, safety vest, safety gloves, safety glasses, and hearing protection), whereas most existing works only focus on one or two classes, usually hardhats or vests. The AI model developed provides good detection accuracy at a distance of 3 and 5 meters in the collaborative environment where we aim at operating (mAP of 99/89%, respectively). The small size of some objects or the potential occlusion by body parts have been identified as potential factors that are detrimental to accuracy, which we have counteracted via data augmentation and cropping of the body before applying PPE detection.

translated by 谷歌翻译

High-resolution synthesis of high-density breast mammograms: Application to improved fairness in deep learning based mass detection

Lidia Garrucho , Kaisar Kushibar , Richard Osuala , Oliver Diaz , Alessandro Catanese , Javier del Riego , Maciej Bobowicz , Fredrik Strand , Laura Igual , Karim Lekadir

分类：计算机视觉

2022-09-20

基于深度学习的计算机辅助检测系统在乳腺癌检测中表现出良好的性能。但是，高密度的乳房显示出较差的检测性能，因为密集组织可以掩盖甚至模拟质量。因此，乳腺癌检测的敏感性可在致密乳房中降低20％以上。此外，与低密度乳房相比，极度致密的病例报告说，患癌症的风险增加。这项研究旨在使用合成高密度的全场数字乳房X线照片（FFDM）作为乳腺质量检测模型训练期间的数据增强来提高高密度乳房的质量检测性能。为此，对使用三个FFDM数据集进行了五个周期一致的GAN（CycleGAN）模型，以高分辨率乳房X线照片中的低密度图像翻译进行了训练。训练图像是由乳房密度双拉德类别分开的，几乎是脂肪的脂肪，双刺是乳房的乳房。我们的结果表明，所提出的数据增强技术在两个不同的测试集中提高了高密度乳房中质量检测的敏感性和精度，并将其作为域适应技术有用。此外，在一项涉及两名专家放射科医生和一名外科肿瘤学家的读者研究中评估了合成图像的临床现实主义。

translated by 谷歌翻译

PatchDropout: Economizing Vision Transformers Using Patch Dropout

Yue Liu , Christos Matsoukas , Fredrik Strand , Hossein Azizpour , Kevin Smith

分类：计算机视觉 | 机器学习

2022-08-10

视觉变压器已经证明了在各种视觉任务中胜过CNN的潜力。但是这些模型的计算和内存要求禁止在许多应用中使用它们，尤其是依赖高分辨率图像的应用程序，例如医学图像分类。更有效地训练VIT的努力过于复杂，需要进行建筑变化或复杂的培训方案。在这项工作中，我们表明可以通过随机删除输入图像贴片来有效地以高分辨率进行标准VIT模型。这种简单的方法（PatchDropout）在标准的自然图像数据集（例如ImageNet）中将拖鞋和内存减少至少50％，而这些节省仅随图像尺寸而增加。在高分辨率医疗数据集CSAW上，我们使用PatchDropout可节省5倍的计算和内存，并提高性能。对于具有固定计算或内存预算的从业人员，PatchDropout可以选择图像分辨率，超参数或模型大小以使其从模型中获得最大的性能。

translated by 谷歌翻译

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Moein Sorkhei , Yue Liu , Hossein Azizpour , Edward Azavedo , Karin Dembrower , Dimitra Ntoula , Athanasios Zouzos , Fredrik Strand , Kevin Smith

分类：计算机视觉 | 机器学习

2021-12-02

由于筛选乳房X线照片的假阴性评估，通常在晚期检测到与其他癌症更差的间隔和大型侵入性乳腺癌。错过的筛选时间检测通常由其周围乳腺组织模糊的肿瘤引起的，这是一种称为掩蔽的现象。为了研究和基准爆发癌症的乳房Xmmpare掩蔽，在这项工作中，我们引入CSAW-M，最大的公共乳房数据集，从10,000多个人收集并用潜在的掩蔽注释。与以前的方法对比测量乳房图像密度作为代理的方法，我们的数据集直接提供了五个专家屏蔽潜在评估的注释。我们还培训了CSAW-M的深入学习模型来估计掩蔽水平，并显示估计的掩蔽更加预测筛查患有间隔和大型侵入性癌症的参与者 - 而不是明确培训这些任务 - 而不是其乳房密度同行。

translated by 谷歌翻译

ECG-Based Electrolyte Prediction: Evaluating Regression and Probabilistic Methods

Philipp Von Bachmann , Daniel Gedon , Fredrik K. Gustafsson , Antônio H. Ribeiro , Erik Lampa , Stefan Gustafsson , Johan Sundström , Thomas B. Schön

分类：计算机视觉 | 机器学习

2022-12-21

Objective: Imbalances of the electrolyte concentration levels in the body can lead to catastrophic consequences, but accurate and accessible measurements could improve patient outcomes. While blood tests provide accurate measurements, they are invasive and the laboratory analysis can be slow or inaccessible. In contrast, an electrocardiogram (ECG) is a widely adopted tool which is quick and simple to acquire. However, the problem of estimating continuous electrolyte concentrations directly from ECGs is not well-studied. We therefore investigate if regression methods can be used for accurate ECG-based prediction of electrolyte concentrations. Methods: We explore the use of deep neural networks (DNNs) for this task. We analyze the regression performance across four electrolytes, utilizing a novel dataset containing over 290000 ECGs. For improved understanding, we also study the full spectrum from continuous predictions to binary classification of extreme concentration levels. To enhance clinical usefulness, we finally extend to a probabilistic regression approach and evaluate different uncertainty estimates. Results: We find that the performance varies significantly between different electrolytes, which is clinically justified in the interplay of electrolytes and their manifestation in the ECG. We also compare the regression accuracy with that of traditional machine learning models, demonstrating superior performance of DNNs. Conclusion: Discretization can lead to good classification performance, but does not help solve the original problem of predicting continuous concentration levels. While probabilistic regression demonstrates potential practical usefulness, the uncertainty estimates are not particularly well-calibrated. Significance: Our study is a first step towards accurate and reliable ECG-based prediction of electrolyte concentration levels.

translated by 谷歌翻译

In Search of Projectively Equivariant Neural Networks

Georg Bökman , Axel Flinth , Fredrik Kahl

分类：计算机视觉

2022-09-29

线性神经网络层的模棱两可。在这项工作中，我们放宽了肩variance条件，只有在投影范围内才是真实的。特别是，我们研究了投射性和普通的肩那样的关系，并表明对于重要的例子，这些问题实际上是等效的。3D中的旋转组在投影平面上投影起作用。在设计用于过滤2D-2D对应的网络时，我们在实验上研究了旋转肩位的实际重要性。完全模型的模型表现不佳，虽然简单地增加了不变的特征，从而在强大的基线产量中得到了改善，但这似乎并不是由于改善的均衡性。

translated by 谷歌翻译

Efficient learning of nonlinear prediction models with time-series privileged information

Bastian Jung , Fredrik D Johansson

分类：机器学习 | (统计)机器学习

2022-09-15

在样本量有限的域中，有效的学习算法至关重要。使用特权信息（LUPI）学习，通过允许预测模型在培训时间访问信息类型，从而提高了样本效率，而在使用模型时，这是不可用的。在最近的工作中，有证据表明，对于线性高斯动力学系统的预测，具有中间时间序列数据访问的卢比学习者永远不会比任何公正的经典学习者更糟糕，而且常常更好。我们为该分析提供了新的见解，并将其推广到潜在动力学系统中的非线性预测任务，从而将理论保证扩展到连接潜在变量和观察值的地图已知到线性变换的情况下。此外，我们提出了基于随机特征和表示该地图未知的情况的表示算法。一套经验结果证实了理论发现，并显示了在非线性预测中使用特权时间序列信息的潜力。

translated by 谷歌翻译

Identifying and Mitigating Flaws of Deep Perceptual Similarity Metrics

Oskar Sjögren , Gustav Grund Pihlgren , Fredrik Sandin , Marcus Liwicki

分类：计算机视觉

2022-07-06

衡量图像的相似性是计算机视觉的基本问题，不存在通用解决方案。尽管已显示出像素的L2-Norm这样的简单指标，例如L2-Norm具有很大的缺陷，但它们仍然受欢迎。一组最新的最新指标减轻了其中一些缺陷是深度的知觉相似性（DPS）指标，其中将相似性评估为神经网络深度特征的距离。但是，DPS指标本身还没有彻底检查其利益，尤其是其缺陷。这项工作研究了最常见的DPS度量，其中通过空间位置进行了比较的深度特征，并比较了平均和排序的深度特征。对指标进行了深入分析，以通过使用专门挑战它们的图像来了解指标的优势和劣势。这项工作为DPS的缺陷提供了新的见解，并进一步提出了对指标的改进。这项工作的实施可在线获得：https：//github.com/guspih/deep_perceptual_similarity_analysis/

translated by 谷歌翻译

Drone Detection and Tracking in Real-Time by Fusion of Different Sensing Modalities

Fredrik Svanström , Fernando Alonso-Fernandez , Cristofer Englund

分类：计算机视觉

2022-07-05

自动检测飞行无人机是一个关键问题，其存在（特别是未经授权）可以造成风险的情况或损害安全性。在这里，我们设计和评估了多传感器无人机检测系统。结合常见的摄像机和麦克风传感器，我们探索了热红外摄像机的使用，指出是一种可行且有希望的解决方案，在相关文献中几乎没有解决。我们的解决方案还集成了鱼眼相机，以监视天空的更大部分，并将其他摄像机转向感兴趣的对象。传感溶液与ADS-B接收器，GPS接收器和雷达模块相辅相成，尽管由于其有限的检测范围，后者未包含在我们的最终部署中。即使此处使用的摄像机的分辨率较低，热摄像机也被证明是与摄像机一样好的可行解决方案。我们作品的另外两个新颖性是创建一个新的公共数据集的多传感器注释数据，该数据与现有的类别相比扩大了类的数量，以及对探测器性能的研究作为传感器到传感器的函数的研究目标距离。还探索了传感器融合，表明可以以这种方式使系统更强大，从而减轻对单个传感器的虚假检测

translated by 谷歌翻译

Sharing pattern submodels for prediction with missing values

Lena Stempfle , Fredrik Johansson

分类：机器学习 | (统计)机器学习

2022-06-22

在机器学习的许多应用中，不可避免的值是不可避免的，并且在培训和测试时都提出了挑战。当反复出现的模式中缺少变量时，已经提出了单独的图案子模型作为解决方案。但是，独立模型并不能有效利用所有可用数据。相反，将共享模型拟合到完整数据集通常取决于插补，而当丢失度取决于未观察到的因素时，这可能是次优的。我们提出了一种替代方法，称为共享模式子模型，该方法做出了a）在测试时对缺失值的强大预测，b）维持或提高模式子模型的预测能力，c）有一个简短的描述，可改善可解释性。我们确定共享是最佳的情况，即使缺失本身具有预测性以及预测目标取决于未观察到的变量。关于合成数据和两个医疗保健数据集的分类和回归实验表明，我们的模型在模式专业化和信息共享之间实现了良好的权衡。

translated by 谷歌翻译